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1. INTRODUCTION 

The log-Sobolev inequality asserts that, in any dimension n and for any smooth 
enough function /: R' ! —» R* := (0, +oo), it holds 

i r |v /] 2 

(1.1) Ent ,„(/)<- '-j~dy n , 

^ Jr" / 

where y n (dx) = ip n (x)dx (2tt) _ 2 expj ~'-f-}dx, x e R", is the standard Gauss¬ 
ian measure with density <p„, |x| = Jt^=i x j stands for the Euclidean norm of 
x = (xi,..., x n ) (accordingly |V/| is the Euclidean length of the gradient) and 
Ent yn if) \= £„ f log f dy„ - f dy n log jl„ f dy n is the entropy of / with respect 
to y n . The constant 1/2 is optimal. Moreover, equality holds in (1.1) if and only 
if / is the exponential of a linear function, i.e. there exist a e R", be R such that 
f(x) = expja ■ x + b], x e R". For simplicity we may write ip and y for p\ and y\. 

The log-Sobolev inequality above goes back to Stam [36] in the late fifties. Later 
Gross, in his seminal paper [24], rediscovered the inequality and proved its funda¬ 
mental equivalence with the so-called hypercontractivity property, a notion used 
by Nelson [33] in quantum filed theory. Since then the log-Sobolev inequality at¬ 
tracted a lot of attention with many developments, applications and connections 
with other fields, including Geometry, Analysis, Combinatorics, Probability The¬ 
ory and Statistical Mechanics. We refer to the monographs [1, 2, 25, 39, 27, 31] 
for an introduction. Finally, we mention that equality cases, in (1.1), appear in the 
paper by Carlen [12]. 

Very recently there has been some interest in the study of the stability of the 
log-Sobolev inequality (1.1). Namely the question is: can one bound the difference 
between the right and left hand side of ( 1.1 ) in term of the distance (in a sense to 
be defined) between / and the set of optimal functions? In other words, can one 
bound from below the deficit 

1 r |v /] 2 

(E2) 6 LS (f):=- '-^dy n -Ent yn (f) 

Jr" j 

in some reasonable way? We refer to [26, 7, 16] for various results in this direc¬ 
tion. What is however currently lacking in the aforementioned literature is a result 
stating that, in fact, SLs(f) R d(f,0) where O := {e a x+b ,a e R", b e R] is the set 
of functions achieving equality in (1.1) and where d is some distance. 

Our aim is to give a result in this direction, using a distance introduced by Bucur 
and Fragala in [ 1 ] that we recall now. 

Bucur and Fragalci ’s construction of a distance modulo translation. In this section 
we recall the procedure of Bucur and Fragala [11] to define a distance (modulo 
translation) in dimension n starting with a distance (modulo translation) in dimen¬ 
sion 1. We first give the definition of a distance modulo translation. 

Let S n be some set of non-negative functions defined on R”. A mapping m : 
S n x S n —> R + is said to be a distance modulo translation (on S n ) if (;) m is 
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symmetric, (;;) it satisfies the triangular inequality and (/'/'/') m(u, v) = 0 iff there 
exists del" such that v(x) = u(x + a) for all x e R". 

Now, given a direction / € S' !_1 (the unit sphere of R"), let x = (x',t/) be 
the decomposition of any point x € R' ! in the direct sum of the linear span of £■' 
and its orthogonal hyperplane He := {v 6 R” :< £,y >= 0} (here < > stands 

for the Euclidean scalar product). Then, for all integrable function f:R n —> R, 
define ff. R —» R, t i-» _/£(?) := f H fix', t())d'H n ~ 1 (x'), where ’H" 1 is the (n - 1)- 
dimensional Hausdorff measure on Hg. Given a distance m modulo translation on 
some set S of non-negative real functions, set 

Sn := {/ € l\I 
and, for f,g e S n , 


i + ), xf (x) e L^R”, R"), U e s for a11 ^ e 


m -11 


:= sup m(f 0 g ( ). 

fsS "- 1 

In [1 1, Corollary 2.3], it is proved that m n is a distance modulo translation on S n . 

Also, Bucur and Fragala [11] introduce the following distance modulo transla¬ 
tion that we may use in the next sections. Set 


& :={ u : 


/■ 


: continuous and I u(x)dx - 1 


Given two probability measures fj.(dx) — u(x)dx, and v(dx) = v(x)dx, u,v € B, 
set T = F~ l o F/j, where F fl (x) := f* u(y)dy and F v (x) := f* v(y)dy are the 
distribution functions of ju and v respectively (observe that, since u € (B, F~ 1 is 
well defined and so does T (note that T is increasing)). T is the transport map that 
pushes forward p onto v, i.e. the mapping satisfying j : MT jd/j = [' hdv for all 
bounded continuous function h. The following is a distance modulo translation on 
1 B (see [11, Proposition 3.5]) 


(1.3) 


d(u,v ) := 


:= f 11 - 1 " 1 4 , 

J max(l,r') M 


We denote by d n and (B n the distance modulo translation and the set of functions 
constructed by the above procedure, starting from d and B in dimension 1. 


2. Stability of the Log-Sobolev inequality 

In order to state our main result, we need first to give a precise statement for 

(1.1) to hold. 

It is well-known that ( 1.1 ) holds for any / such that J R „ lfldy,,+f R „ - 'jr~dy n < oo, 
i.e. |/| 1/2 6 H l (y n ), see e.g. [10, Chapter 1]. By a density argument one can restrict 

(1.1) , without loss, to all / positive (since |V|/|| = |V/| almost everywhere), and 
by homogeneity, we can assume furthermore that f.„ f dy n = 1. We call Jl n the 

set of C 1 functions /: R” —» R* such that J fn f dy n = 1 and J R ,„ — 'j-dy n < oo. It 
is dense in the set of all functions satisfying the log-Sobolev Inequality (1.1) and 
is contained in (B n . We observe that the set of extremal functions (with the proper 
normalization) [exp{<7 ■ x - € R"} is contained in J\. n . 
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We are now in position to state our main theorem (recall the definition of d„ 
from the previous section). 

Theorem 2.1. For all n and all f e 2R n it holds 

1 2 

$Ls(f) > ^d n (ftp n ,(p n ) ■ 

Before moving to the proof of Theorem 2.1 which is very short and elementary, 
let us comment on the above result. 

First, from the above result, we (partially) recover the cases of equality in the 
log-Sobolev inequality for the Gaussian measure [ 2]. Indeed, / e Fl n achieves 
the equality in the log-Sobolev inequality iff dLsif) = 0 iff ftp,, is a translation of 
tp n iff fix) = exp[-n • x - 4j-} for some a e R". This is only partial since Theorem 
2.1 do not deal with all functions satisfying the log-Sobolev inequality but only 
with / € Fi,,. There is in fact some technical issues here: the distance d is no 
more a distance modulo translation if Ff x (pi is the one dimensional probability 
measure with density /) is not absolutely continuous (a property that is guaranteed 
by the fact that, in the definition of Fl n , we impose the positivity of the functions), 
see [11, Remark 3.6 (i)]. Hence, a result involving the distance d n cannot recover, 
by essence, the full generality of Carlen’s equality cases [12]. However, Fi„ is 
very close to cover the set of all functions satisfying the log-Sobolev inequality (in 
particular it is dense in such a space) and, to the best of our knowledge, there is 
no result in the current literature that gives a lower bound of the deficit involving a 
distance without any second moment condition. 

The assumption / of class C 1 , in the definition of Fi n can certainly be relaxed. 
Indeed, one only needs, in dimension 1, that F~ [ is an absolutely continuous func¬ 
tion [11, Proposition 3.5] (for d/i(x) = f(x)tp(x)dx, x e R). In dimension n, such a 
property should hold for all directions f e §”“ 1 . For this reason, and as mentioned 
above, there is no hope to obtain the whole family of functions satisfying the log- 
Sobolev Inequality. Hence, we opted for an easy and clean presentation rather than 
for a more technical one (a weaker assumption on / would have led us to technical 
approximations in many places, that, to our opinion, play no essential role). 

We also observe that our result does not capture the product character of the 
log-Sobolev inequality. Indeed, if one considers, on R", a function of the form 
f(x) = h(x\)h(x 2 ) ... h(x n ), x = (jq,..., x n ), with h: R —> R*, then it is not dif¬ 
ficult to see that 8 is if) is of order n (i.e. 8 is if) - n8is(h )) thanks to the ten- 
sorisation property of (1.1) (see e.g. [1, Chapter 1]), while d n ( ftp„,tp n ) is of order 
1 (i.e. d n (ftp n ,<p n ) = d(htp,tp)). This mainly comes from our use of Bucur and 
Fragala’s quantitative Prekopa-Leindler Inequality which is also, by construction, 
1 dimensional. See below for some results based on the tensorisation property of 
the log-Sobolev inequality. 

Proof of Theorem 2.L The proof is based on the approach of Bobkov and Ledoux 
[9] to the log-Sobolev inequality by mean of the Prekopa-Leindler Inequality, to¬ 
gether with an improved version of the Prekopa-Leindler Inequality of Bucur and 
Fragala [1 ]. Our starting point is the following result (see [11, Proposition 3.5]): 
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given a triple u, v, w: IS" —> K.+ with u, v e B n and 4 e [0,1] that satisfy w(Ax + 
(1 - A)y) > u(x) A v(y) 1 ~ A for all x,y e R", it holds 

(2.1) f w(x)dx - 1 > l 4 1+/1 (1 - A) 2 ~ A d n (u, vf. 

Jr" 2 

We stress that the constant A in the right hand side of the latter is not given explicitly 
in [11], but the reader can easily recover such a bound following carefully the proof 
of [ 1, Proposition 3.5]. (Inequality (2.1) goes back to the seventies [29, 35] and 
has numerous applications in convex geometry and functional analysis. We refer 
to the monographs [6, 17, 39] for an introduction. We further mention that equality 
cases are given in [15], and refer to Ball and Boroczky [3, 4] for related results on 
the stability of the Prekopa-Leindler Inequality.) 

Our aim is to apply (2.1) to a proper choice of triple u,v,w. Following [9], 
let / = e g with g sufficiently smooth with compact support and J R „ f dy n = 1, 
A e (0,1), and set 

u A (x) = Y — ^ X , v(y) = ip n (y) and w A (z) = e gAz) ip n (z) 

Jr n el - Ad 7n 


with 


gA(z) ■■ = 


sup 

j:,y: 

(1 —/f)A+/Iy=z 


g(x) - 


4(1 


— I*-y| 2 )-(1-4) log f 
/ Jr" 


e ‘-- 1 dy n 


The function g A is the optimal function such that it holds i-v,{(( I - A)x + Ay) > 
u / {(x') ] ~ A v(y) A . Set li A (z) := sup *, y: (g(.x) - y ^=^\x - y| 2 ). Then, by (2.1) 

above, we get 



e h l dy n > 



1 + -4 1+/1 (1 


A) 2 A d n (u A , v) 2 


The aim is to take the limit 4^0. We observe that (see [9] for details), as A tends 
to zero 



e 8 dy„ + AEnty n (e g ) + o(4) 


and 


f e llA dy n = f e 8 dy n + ——- f \Vgj 2 e 8 dy„ + o(A). 

J R" JR" 1 _ A ) J R" 

Therefore, dividing by 4, and taking the limit, we end up with 

liminf ^(l - A) 2 ~ /l d„(u A , v) 2 + Ent y „(e ? ) < y f \Vg\ 2 e g dy n . 
A^>0 Z Z J^n 


We are left with the study of liminf^o d„(u A , v) 2 , since lim ^o A a ( I - A) 2 ~ a = 1. 
For simplicity set u := uq (i.e. u is the function u A defined above with 4 = 0). By the 
Lebesgue Theorem we observe that, for any direction £ e S''” 1 , lim ^o(«,i)f = uy- 
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Hence, using again the Lebesgue Theorem (observe that, in the definition of d, 

|1 - r|/max(l,r) < 1) 

liminf d n {ii.\,v)> sup liminf d(fuf)^, vf) - sup d„(u%, (yp n )f) = d n (e g <p n , <p n )- 
/1 ^° feS "- 1 /l->0 ‘ feS "- 1 

The expected result follows for g sufficiently smooth. The result for a general 
/ e Pl„ follows by an easy approximation argument (using again the monotone 
convergence Theorem and the Lebesgue Theorem), details are left to the reader. 


□ 


Next we derive from Theorem 2.1 a lower bound on the log-Sobolev inequality, 
in dimension n, that involves n times the one dimensional distance d. Such a result 
will capture on one hand the product structure of the inequality, but on the other 
hand the deficit will no more be bounded by a distance (modulo translation). 

We need some notation. Given x - (jq,..., x n ) e R”, i e {1,..., n] and y- t e R, 
set x‘ := (xi,...,xi-i,x i+ i,...,x n ) and x l y { := (xi,... ,x ( -_i,y/,x (+ i, ..., x n ) (so 
that x‘xi = x). Then, for all functions /: R" —» R and all v e R", we denote by 
: R —> R the one dimensional function defined by / ? (>’ ( ) := f(x l yi), y/ e R 
(obviously f- x ,(xi) = f{x)). 

We may prove the following result. 

Corollary 2.2. For all n and all f e Fl n it holds 



Now, by construction, if f(x) = h(x\)li(x 2 ) ■ ■ ■ h(x n ), x = (xi ,..., x n ), with 
Iv. R — > R*, then both 6ls( f) and the right hand side of the latter are of (the 
correct) order n. 

Proof. The proof uses the tensorisation property of the entropy. It is well known 
(see e.g. [1, Chapter 1]) that for any /: R" —» R, it holds 



Hence, applying Theorem 2.1 n times, we get (since /', (x/) = §y(x)) 



<fifdy n -\{x l ) 



The expected result follows. 


□ 
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3. Stability of the Talagrand transport-entropy Inequality 

In this section we bound the deficit in the so-called Talagrand inequality, using 
again the distance d n introduced by Bucur and Fragala. Recall that (see e.g. [38]) 
the Kantorovich-Wasserstein distance W 2 is defined as 

W 2 (v,n) := inf | JJ " \x - y\ 2 n(dx, dy) 

where the infimum runs over all couplings n on R" x R” with first marginal v and 
second marginal p (i.e. n(W\ dy) = p(dy) and n(dx, IF 5 ") = v(dx)). Talagrand, in his 
seminal paper [ 37 ], proved the following inequality: for all probability measure v 
on R' ! , absolutely continuous with respect to y n , it holds 

(3.1) Wj(v,y n )<2H(v\y n ), 

where H(v\y n ) := j. log jjf dv if v « y n (whose density is denoted by dv/dy n ) 
and H(v\y) = +oo otherwise, is the relative entropy of v with respect to y n . Such an 
inequality, that is usually called Talagrand transport-entropy inequality, is related 
to Gaussian concentration in infinite dimension [37, 32, 19] (see the monographs 
[20, 28] for an introduction). It is known, since the celebrated work by Otto and 
Villani [34], that the log-Sobolev inequality (1.1) implies the Talagrand inequality 

(3.1) (in any dimension, see [8, 40, 5, 30, 21,22, 23, 18] for alternative proofs and 
extensions). 

The stability of (3.1) is also studied in [26, 16, 7, 14], We may obtain that, 
as a direct consequence of the transport of mass approach of (3.1) by Cordero- 
Erausquin [13] and of the tensorisation property, one can bound from below, as for 
the log-Sobolev inequality, the deficit in (3.1) by the distance d„ defined by Bucur 
and Fragala. 

Theorem 3.1. For all probability measure v on R" with continuous and positive 
density f e B n with respect to the Gaussian measure y n , it holds 

(3.2) StM := 2H(v\y n ) - W 2 2 (v, y n ) > ^d n (f<p n , <p n f. 

The above result together with theorem (2.1) somehow justify the use of the 
distance d n . As for Theorem 2.1 the bound on the deficit is one dimensional and 
thus not of the correct order. This fact may become clear to the reader through the 
proof: we use some tensorisation property but apply a bound on the deficit only to 
one single coordinate. 

Proof. The proof goes in two steps : we first prove the lower bound of the deficit 
in dimension 1, then we use a tensorisation procedure. 

From [ 1 3] we can extract the following one dimensional inequality (here /: R —> 

R*) 



draff) > f \r' -1 -log T]dy, 

Jr 
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where T = F v 1 o F y is the push forward of y onto v. Using that s - 1 - log s > 
\ (maxTTv)) an< ^ ^e Cauchy-Schwartz Inequality, we can conclude that 


6Talif ) ^ 


1 

2 


r I i - r 

Jr \max(l, 7 1 ') 


> 


1 / r u-H \ 2 

2 \J R max(l,rO 


1 

2 




which ends the proof of the first step (since d is symmetric). 

Next, recall the tensorisation property of the Kantorovich-Wasserstein metric 
and of the relative entropy: for /: K." —> R, v(dx) = f{x)dx, we have 


W 2 (v,y n )<W 2 (v u y) + 



W%(v Xu ... tXi ,y)dyi(xi,.. .,x { ) 


and 


n -1 p 

H(v\y„) = H(v\ I y) + V I H(y Xl tX . \y)dy i (x\x { ), 

tr 


where we used the desintegration formula 


v(dx i,..., dx n ) = vi(dxi)v Xl (dx 2 )v XuX2 (dx 3 ) x • • • x v Xu .^ Xn ^(dx n ). 

Now the supremum defining d n (f(p n , ip„) is reached at some (■' e S" _I that we may 
assume for simplicity and without loss of generality (since y n is invariant by ro¬ 
tation) to be the first unit vector of the canonical basis (1,0,... ,0). Using the 
tensorisation formulas above, applying the result we obtained in dimension 1, and 
(3.1) n - 1 times, we thus get 


n — 1 „ 

W 2 {v,y n ) < Wi(v u y) + V I Wf(v. Vl „.. jX ,, y)dy i (x ] ,..., xf) 

Jw 


< 2H(v] \y) - \d(fnp, ip) 2 + 2 V f H(v Xlt ... tX .\y)dy i (x 1 ,x { ) 

z i=i 

= 2H(v\y) - ^d{fiip,ip) 2 = 2H(v\y) - \ d n (fip n ,<Pn ) 2 » 


where we set f\ for the density of v\ with respect to y. By construction vj is the 
first marginal of v so that f\<p = ( ftp n )p- This ends the proof. □ 
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